منابع مشابه
Hapax Legomena in the Old English Exodus: An Etymological Study
This essay serves as an attempt to examine various hapax legomena in the Old English poem Exodus from an etymological perspective in order to more conclusively determine its date of composition. Exodus is somewhat unique among Old English epic poems in its proportion ofhapax legeomena, having over 150 examples in under 600 surviving lines. Its date of composition is ofparticular scholarly inter...
متن کاملAn Asymptotic Model for the English Hapax/Vocabulary Ratio
In the known literature, hapax legomena in an English text or a collection of texts roughly account for about 50% of the vocabulary. This sort of constancy is baffling. The 100-millionword British National Corpus was used to study this phenomenon. The result reveals that the hapax/vocabulary ratio follows a U-shaped pattern. Initially, as the size of text increases, the hapax/vocabulary ratio d...
متن کاملEstimating Lexical Priors for Low-Frequency Syncretic Forms
Abstract Given a previously unseen form that is morphologically n-ways ambiguous, what is the best estimator for the lexical prior probabilities for the various functions of the form? We argue that the best estimator is provided by computing the relative frequencies of the various functions among the hapax legomena — the forms that occur exactly once in a corpus. This result has important impli...
متن کاملHapax - Enriching Reverse Engineering with Semantic Clustering
Many reverse engineering approaches focus on structural information and ignore semantic information like the naming of identifiers or comments. But developers put their domain knowledge into exactly these parts of the source code. Without understanding the semantics of the code, one cannot tell its meaning. We use Latent Semantic Indexing, an information retrieval technique [3], to retrieve the...
متن کاملMultiword Vocabulary in Japanese ESL Texts
In this paper we describe our analysis of vocabulary across three sets of Japanese ESL texts. We focus upon frequency analysis of individual words and multiword sequences (n-grams), giving cross comparisons of 2, 3 and 4-gram multiword sequences. In addition, we consider the degree of emphasis on multiword vocabulary that is evident in each textbook corpus. This is derived from analysis of the ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Orient
سال: 1991
ISSN: 1884-1392,0473-3851
DOI: 10.5356/orient1960.27.36